A segment-based C0 adaptation scheme for PMC-based noisy Mandarin speech recognition

نویسندگان

  • Wei-Tyng Hong
  • Sin-Horng Chen
چکیده

A segment-based C0 (the zero-th order of cepstral coefficient) adaptation scheme for PMC-based Mandarin speech recognition is proposed in this paper. It incorporates a new C0 model of speech signal into the PMC method to improve the gain matching between the clean-speech HMM models and the current noise model. The C0 model is constructed in the training phase by jointly modeling the normalized C0 with other MFCC recognition features to form C0-normalized HMM models. In the testing phase, it pre-segments the input utterance into syllablelike segments, performs C0-denormaliztion operations to expand the C0-normalized HMM models, and uses them in the PMC method. Compared with the conventional PMC method, the proposed method can achieve a much better noise compensation effect due to the use of more precise gain matching in the PMC model combination. Experimental results showed that the basesyllable accuracy rate was significantly upgraded for continuous noisy Mandarin speech recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Efficient Pmc for Robust Speech Recognition in Noisy Environments

The environment adaptive methods play an important part in improving the robustness of automatic speech recognition. In this paper, PMC is reviewed and improved to achieve the better performance. The experiments have been done based on the Cambridge’s HTK toolkit to implement the continuous Mandarin digit recognition in noisy environments

متن کامل

A novel use of residual noise model for modified PMC

In this paper, a new approach based on model adaptation is proposed for acoustic mismatch problem. A specific bias model—residual noise model—is presented, which is the joint compensation model for additive and convolutive bias. The novel noise model is estimated on the basis of maximum likelihood manner. In conjunction with the Parallel Model combination (PMC), it is effective for noisy enviro...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

A robust RNN-based pre-classification for noisy Mandarin speech recognition

This paper addressed the problem of speech signal preclassification for robust noisy speech recognition. A novel RNN-based pre-classification scheme for noisy Mandarin speech recognition is proposed. The RNN, which is trained to be insensitive to noise-level variation, is employed to classify each input frame into the three broad classes of initial, final and pure-noise. An on-line noise tracki...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999